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Abstract 

In an attempt to improve on our earlier W3 theory [J. Chem. Phys. 120, 4129 (2004)] we 
consider such refinements as more accurate estimates for the contribution of connected quadruple 
excitations (T4), inclusion of connected quintuple excitations (T5), diagonal Born-Oppenheimer 
corrections (DBOC), and improved basis set extrapolation procedures. Revised experimental data 
for validation purposes were obtained from the latest version of the ATcT (Active Thermochemical 
Tables) Thermochemical Network. The recent CCSDT(Q) method offers a cost-effective way of 
estimating T4, but is insufficient by itself if the molecule exhibits some nondynamical correlation. 
The latter considerably slows down basis set convergence for T4, and anomalous basis set 
convergence in highly polar systems makes two-point extrapolation procedures unusable. However, 
we found that the CCSDTQ— CCSDT(Q) difference converges quite rapidly with the basis set, and 
that the formula 1.10[CCSDT(Q)/cc-pVTZ+CCSDTQ/cc-pVDZ-CCSDT(Q)/cc-pVDZ] offers a 
very reliable as well as fairly cost-effective estimate of the basis set limit T4 contribution. The 
T5 contribution converges very rapidly with the basis set, and even a simple double-zeta basis 
set appears to be adequate. The largest T5 contribution found in the present work is on the 
order of 0.5 kcal/mol (for ozone). DBOC corrections are significant at the 0.1 kcal/mol level 
in hydride systems. Post-CCSD(T) contributions to the core-valence correlation energy are only 
significant at that level in systems with severe nondynamical correlation effects. Based on the 
accumulated experience, a new computational thermochemistry protocol for first-and second- 
row main-group systems, to be known as W4 theory, is proposed. Its computational cost is 
not insurmountably higher than that of the earlier W3 theory, while performance is markedly 
superior. Our W4 atomization energies for a number of key species are in excellent agreement 
(better than 0.1 kcal/mol on average, 95% confidence intervals narrower than 1 kJ/mol) with the 
latest experimental data obtained from Active Thermochemical Tables. Lower-cost variants are 
proposed: the sequence Wl— >W2.2^W3.2— >W41ite^W4 is proposed as a converging hierarchy of 
computational thermochemistry methods. A simple a priori estimate for the importance of post- 
CCSD(T) correlation contributions (and hence a pessimistic estimate for the error in a W2-type 
calculation) is proposed. 
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I. INTRODUCTION 



In the past fifteen years, computational thermochemistry has matured to the point where 
its accuracy is often competitive with all but the most accurate experimental techniques. 

A compact overview of computational thermochemistry methods in all their variety has 
very recently been published by one of usjl| , while a book with more detailed reviews of the 
various techniques was published in 2001 [2 1 . In terms of 'ready-made' nonempirical small- 
molecule methods of sub-kcal/mol accuracy, there have been two major developments in the 
last few years. One is the Wn family of computational thermochemistry protocols (to be 



discussed below) 



sag 



the other has been the HEAT (Highly accurate Extrapolated Ab 



initio The rm oche mistry ) project b y a .nultinationa! gr oup of research . In this context, 
mention should be made of the related 'focal point approach' pioneered by Allen[7| — which 
is however more a general strategy than a precisely defined computational protocol — as well 
as of the configuration interaction extrapolation based work of Bytautas and Ruedenberg g| . 
The 'Wn theory' naming scheme was introduced by. analogy to the 'Gn theory' family 



of methods of the late lamented Pople and coworkers^]. The basic philosophy of the Wn 
family of methods can be outlined as follows: 

• All terms in the Hamiltonian that can reasonably contribute at the kJ/mol level to 
the atomization energy should be retained; 

• Basis set convergence is established for each contribution individually, and the smallest 
basis sets are used for each that still lead to acceptable basis set incompleteness errors 
for the relevant contribution; 

• As a result, computational effort is kept down to the minimum consistent with the 
required accuracy; 

• No parameters derived from experiment are employed: where possible, physically or 
empirically rational basis set extrapolations are employed. 

Wl theory 0,Q uses basis sets of no larger than spdfg quality, and no electron correlation 
methods more elaborate than CCSD(T). It represents an approximation to the relativistic, 
clamped-nuclei, basis set limit CCSD(T) energy. (Scalar relativistic corrections were 
obtained as one-electron Darwin and mass-velocity terms from averaged coupled pair 
wavef unctions, although the implementation in Gaussian 03 [n| employs Douglas-Kroll scalar 



relativistics Q Q at the CCSD(T) level.) W2 theory^ Q aims at the same target as 
Wl theory but uses more elaborate spdfgh basis set and is in general more accurate. For 
systems dominated by a single reference determinant, W2 theory can usually achieve kJ/mol 
accuracy. 

For systems with significant nondynamical correlation, the CCSD(T) limit differs 
significantly from the FCI (full configuration interaction) limit. Even for systems like the 
diatomics N 2 , O2, and F 2 , W2 will be in error by 0.5-0.7 kcal/mol, and for ozone an error of 



3 kcal/mol is seenj^. Two main improvements were introduced in W3 theory. The first was 

a more robust scalar relativistic correction based on DKH-CCSD(T) calculations (Douglas- 
flf] 

Kroll-Hess|ll|, 112J), with an eye to future extension of applicability to elements heavier than 
Ar. The second, which proved crucial, was an account for higher-order T3 effects — i.e., the 
CCSDT— CCSD(T) difference — on the one hand, and for connected quadruple excitations 
T4 on the other hand. W3 proved much more robust to nondynamical correlation effects than 
its predecessors for systems dominated by a single reference determinant, it is comparable 
in accuracy to W2 theory. 

Attempts to surpass W3- level accuracy (RMSD of about 1.2 kJ/mol, 95% confidence 
interval of about 2.5 kJ/mol) were impeded by a number of problems, of which we shall cite 
only the two most important ones. On the one hand, it would definitely be necessary to 
consider connected quadruple excitation effects with basis sets larger than spd quality, but 
this was precluded by the limiting n 4 N 6 computational cost scaling (where n and N are the 
numbers of electrons and basis functions, respectively) of CCSDTQ calculations. On the 
other hand, we were working in an accuracy regime comparable to that of all but the very 
best experimental thermochemical data, and meaningful comparisons with experiment were 
just not possible beyond the kJ/mol range, except for a few select molecules. 

The ™-t develops of the CCSDT (Q ) m ethodQ and of a code for arbitrary 
quasiperturbative techniques [l^ opened a potential avenue for more cost-effective 
treatments of T4 on the theoretical front. In terms of availability of accurate and reliable 
thermochemical benchmarks, the advent of Active Thermochemical Tables afforded relief. 

Active Thermochemical Tables (ATcT) are a new paradigm of how to obtain accurate, 
reliable, and internally consistent thermochemical values by using all available knowledge 3, 
Q 0, and overcome the limitations that are deeply engrained in the traditional 
approach to thermochemistry, such as that used in all traditional thermochemical 
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compilations. As opposed to the traditional sequential approach, ATcT derives its results 
from a Thermochemical Network (TN). Where available, the thermochemical values used 
in the present work for the purpose of benchmarking the W4 method have been obtained 
from the latest version of the Core (Argonne) Thermochemical Network, C(A)TN, that is 
currently under development and encompasses ~700 chemical species containing H, O, C, N, 
and halogens, which are interlinked by >7000 thermochemically-relevant determinations [19]. 
In addition, the benchmark ATcT values for three sulfur-containing species have been 
obtained from a separate adjunct ATcT TN. The adjunct TN is currently under intense 
development and scrutiny as part of a joint Argonne- NIST project H, 

which is attempting 

to address and resolve some of the inherent inconsistencies surrounding the accepted 
experimental thermochemical values for several key sulfur-containing species. Consequently, 
within the context of the present work, we have limited ourselves to using as benchmarks 
only three sulfur- containing species from ATcT (H 2 S, SO and SO2), whose values happen 
to be invariant (within 0.05 kcal/mol or better) to the finer details of the adjunct sulfur- 
containing ATcT TN. For the species of interest here, the current version of C(A)TN includes 
all available experimental results and also considers a selection of prior highly accurate 
theoretical results (with weights proportional to the expected uncertainties), but, in order 
to keep the ATcT benchmarks used here independent of the current computational results, 
does not include W4. These will be added to the Thermochemical Network in a subsequent 
revision of C(A)TN. Full details of how the ATcT values were developed and what data are 
they based on will be published separately in a forthcoming series of papers^!]. 
In the present work, we will: 

• explore basis set convergence of connected quadruple and quintuple excitations in 
greater detail; 

• consider the effect of diagonal Born-Oppenheimer corrections; 

• present a computational protocol called W4 theory which yields significant 
improvements over W3 theory, for a tractable additional computational cost; 

• validate it against the best available benchmark values from ATcT; 

• present a simple energy-based diagnostic for the reliability of thermochemical 
approaches that neglect post-CCSD(T) correlation. 
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II. COMPUTATIONAL DETAILS 



All quantum mechanical calculations were carried out on the Linux cluster of the Martin 
group at the Weizmann Institute. In practice, most results were obtained from four dual- 
processor 32-bit (Intel Xeon 2.8 and 3.06 GHz) and a single four-processor 64-bit (AMD 
Opteron 846) machine, all custom-built by Access Technologies of Rehovot, Israel. These 
machines are equipped with 4-way and 8-way striped disk arrays, respectively, made up of 72 
GB Ultra320 SCSI disks. The high sustained I/O throughputs of these machines |23f proved 
essential to handle the daunting I/O requirements for many of the calculations reported. 

All CCSD and CCSD(T) calculations were carried out using MOLPRO 2002^ 
Conventional, rather than direct, algorithms were used throughout as this proved more 
efficient with the special I/O hardware available. 

The DBOC calculations, and some full CCSDT calculations, were carried out using the 
Austin-Mainz-Budapest version of ACES II J^fJ. The remaining post-CCSD(T) calculations 
- CCSDT(Q), CCSDTQ, CCSDTQ(5), CCSDTQ5, CCSDTQ5(6), CCSDTQ56, and full 
CI — were carried out by means of the general coupled cluster code MRCC of Kallay and 
coworkers [3]. The required integrals and SCF orbitals for the latter were obtained using 
ACES II. 

The basis sets used are all of the correlation consistent family of Dunning and 



coworkers 



271 ]. For the large-scale CCSD(T) calculation, we combined regular cc-pVnZ 



basis sets (n=D,T,Q,5,6) on hyd rore n with aug-cc-pVnZ ([diffuse-function] augmented 
polarization consistent) basis sets|28] on boron through fluorine, and Wilson's aug-cc- 
pV(n+<i)Z basis sets[29] on phosphorus through chlorine. The latter contain high-exponent d 
functions to cope with 'inner polarization' effects 3(|: these occur in second-row elements in 



high oxidation states as a result of back-bonding into their low-lying d orbitals [3JJ. For 
post-CCSD(T) calculations, regular cc-pVnZ basis sets were employed unless indicated 
otherwise. In core-valence correlation calculations, we employed Peterson's core-valence 
weighted correlation consistent basis sets[3], aug-cc-pwCVnZ. Finally, the scalar relativistic 
calculations were carried out using both the unpublished Douglas-Kroll optimized correlation 
consistent basis sets by Oren and Martin 33] and the PNNL (Pacific Northwest National 
Laboratory) relativistically contracted correlation consistent basis sets]34]. (We verified 
that fundamentally the same results were obtained.) 
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reference geometries are obtained at the CCSD(T)/cc-pV(Q+d)Z level. Complete 
^geometries and total energies are available as supporting information to the present 



III. RESULTS AND DISCUSSION 

A. Basis set convergence of post-CCSDT correlation contributions 

In order to get a clearer picture of basis set convergence behavior in post-CCSDT 
contributions, we considered in detail a set of 22 mostly diatomic molecules, namely, HF, 
H 2 0, NH 3 , PH 3 , H 2 S, HC1, CO, CS, Cl 2 , C1F, N 2 , B 2 , CH, CN, NO, SO, 2 , F 2 , C 2 , BN, 
BeO, and MgO. They were chosen in such a way as to span the gamut from dominated by 
a single reference configuration to pathological nondynamical correlation. Because of their 
small size, consideration of fairly large basis sets is possible. Results can be found in Table HJ 

Let us first consider quasiperturbative connected quadruples, which we were able to 
obtain with cc-pVQZ basis sets for all systems. Considerable basis set sensitivity was seen, 
especially for systems with significant nondynamical correlation. The data give us reason to 
believe cc-pVQZ is close to basis set convergence, but this is clearly not a practically feasible 
basis set for larger systems. 

Generally the (Q) contributions increase monotonically with the basis set. However, 
in very polar systems (H 2 and HF, with MgO being an extreme case) anomalous 
nonmonotonic convergence is observed. The problem appears to largely go away if aug- 
cc-pVnZ basis sets are used on O and F. However, this may not be something one can do 
on a routine basis. 

As we noted before for T4 overall, extrapolation from cc-pVDZ and cc-pVTZ basis 
sets works well in some cases, but does more harm than good in cases like H 2 0, HF, and 
especially MgO. 

Scaling the cc-pVDZ result — analogous to the scaled CCSDTQ/cc-pVDZ— CCSDT/cc- 
pVDZ difference in W3 theory — does less harm but is clearly a rather crude approximation. 
However, there is a very high statistical correlation (R 2 = 0.9945) between the cc-pVTZ 
values and the numbers extrapolated from cc-pVTZ and cc-pVQZ basis sets. 

As expected, the CCSDTQ— CCSDT(Q) difference is quite small in systems dominated 
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by a single reference configuration (like HF and H 2 0). However, it becomes rather significant 
in systems with moderate to strong nondynamical correlation. This casts some doubt on 
the general applicability of CCSDT(Q) on its own as a post-CCSDT correction. 

Fortunately, basis set convergence of the CCSDTQ— CCSDT(Q) difference is much more 
rapid than for the (Q) contribution, and even cc-pVDZ generally appears to be sufficient. 
This naturally suggests a simple additivity approximation: 

AE(f 4 ) « ci(£[CCSDT(Q)/cc-pVTZ] - £[CCSDT/cc-pVTZ] + 

+£[CCSDTQ/cc-pVDZ] - £[CCSDT(Q)/cc-pVDZ]) (1) 

We found that this approximation works very well when fitted against the extrapolated basis 
set limit data. Eliminating the anomalous cases of BeO and MgO, we find ci = 1.10 to be 
optimal with R 2 = 0.997. We also found that, in practice, all the steps in this additivity 
approximation are computationally feasible if CCSDTQ/cc-pVDZ is feasible — that is, any 
system tractable at the W3 level will be amenable to this improved connected quadruples 
term. 

What about connected quintuple excitations? Results can be found in Table |nj We find 
that the CCSDTQ5— CCSDTQ difference converges quite rapidly with the basis set, and 
that even a simple double- zeta basis set comes quite close to the presumed basis set limit. 
In our experience, a CCSDTQ5/DZ calculation is feasible for any system that we were able 
to do CCSDTQ/cc-pVDZ for. 

How useful are quasiperturbative techniques here? We find the CCSDTQ5— CCSDTQ(5) 
difference to be essentially negligible in systems dominated by a single reference 
configuration. Even with fairly mild nondynamical correlation, however, this difference 
behaves somewhat erratically. If moderate to strong static correlation is present, however, 
we see exaggerated (5) contributions, with CCSDTQ5— CCSDTQ (5) being large in the 
opposite direction and T 5 overall thus rather modest (albeit significant enough for inclusion 
in anything purporting to improve on W3 theory). 

Finally, the largest Tq contribution (Table ITTj) is seen for singlet C2 (about 0.06 kcal/mol). 
In less pathologically multireference systems, it can be neglected for all but the most accurate 
work (which would also require much greater accuracy in all other contributions). For 
systems with no or only mild static correlation, the quasiperturbative CCSDTQ5(6) method 
reproduces essentially the entire effect: in cases with severe nondynamical correlation, we 
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note again that (6) tends to exaggerate the effect. 



B. Connected quadruple excitations for the W3 set 

We now turn to the W3 set of molecules. Connected quadruple excitation contributions 
to the total atomization energy can once again be found in Table U] The largest contribution 
is found for ozone (3.81 kcal/mol) but it exceeds 2 kcal/mol for several other molecules, and 1 
kcal/mol for surpri sing ly many. It was conjectured by Bak et al. and confirmed repeatedly 
before (e.g. 0, [3, Q, S|) that the main reason for the good performance of CCSD(T) 
for many systems is error compensation between neglect of higher-order T 3 effects (which 
generally decrease the atomization energy) and complete neglect of T 4 (which systematically 
increases atomization energies). This is once more confirmed here. 

Differences between the W3 estimate for T4 effects that is, 

1.25(£[CCSDTQ/cc-pVDZ] - £[CCSDT/cc-pVDZ]) and Eq.flU) may reach 0.5 
kcal/mol and are somewhat erratic in character. The W3 estimate was a stop-gap measure 
at the time for want of a better yet sufficiently cost-effective alternative. Eq|T] appears to 
have made it obsolete. 



C. Connected quintuple and higher excitations 

Connected quintuple excitation contributions to the total atomization energy for the W3 
set of molecules can be found in Table |n] The largest contribution found is for ozone, 0.41 
kcal/mol. With one exception (OCS, decrease by 0.01 kcal/mol), T 5 systematically increases 
the binding energy. 

We were unable to obtain iterative T 6 corrections for many of the systems. In light of 
what was noted for the diatomics above, and in light of the quasiperturbative T 6 corrections 
which we were able to obtain for most systems, T 6 contributions are expected to be negligible 
compared to other intrinsic errors in our calculations. 
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D. Diagonal Born-Oppenheimer corrections 



Valeev and Sherrillj40j studied correlation effects on diagonal Born-Oppenheimer 
corrections in some detail, and concluded that they should be very small for relative energies, 
and that the DBOCs should be adequately reproduced at the Hartree-Fock level with a basis 
set of at least AVTZ quality. This is the level of theory considered by us. Contributions can 
be found in Table HVl (ROHF/AVDZ values are also given there, in order to show that the 
ROHF/AVTZ values are converged with respect to the basis set.) 

As expected, they are most important for the hydrides, and can reach or exceed 0.1 
kcal/mol if multiple hydrogens are present. For the benzene molecule (not listed in Table 
IIVJ) . we computed a contribution of 0.24 kcal/mol. For semirigid closed-shell molecules, 
DBOC appears to systematically increase the binding energy, but decreases are seen for 
some open-shell and less rigid molecules. 

E. Larger basis sets for valence higher-order T3 contributions 

In W3 theory, the CCSDT-CCSD(T) difference was extrapolated from cc-pVDZ and 
cc-pVTZ basis sets. In this work, we considered whether it would be worthwhile to 
extrapolate this contribution from larger cc-pVTZ and cc-pVQZ basis sets (at the expense 
of approximately a factor 32 in computational effort). As can be seen in Table HTT1 the effect 
is in the 0.01 kcal/mol range in most cases, with B 2 (—0.11 kcal/mol), PH 3 (0.08 kcal/mol), 
and O2 (0.07 kcal/mol) being the exceptions that prove the rule. We have therefore elected 
to retain cc-pVDZ and cc-pVTZ basis sets for this contribution. 

F. Post-CCSD(T) contributions to the core-valence contribution 

It was shown in the original Wl paper [3J] that connected triple excitations are quite 
important for the core- valence contribution, accounting for as much as half the total. It can 
then not a priori be ruled out that post-CCSD(T) contributions to the inner-shell correlation 
energy would be non-negligible. Limited anecdoctal evidence gathered in the W3 paper [^] 
suggests possible contributions on the order of 0.05-0.10 kcal/mol, which might well be 
relevant for our purposes. Clearly, this point bears further scrutiny. 

In addition, CCSDT-CCSD(T) differential core-valence contributions may resolve an 
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ambiguity resulting from different CCSD(T) definitions. (See Appendix 1 for a more 
detailed discussion.) Briefly, closed-shell CCSD(T) and CCSDT are uniquely defined, as 
are UCCSD(T) and UCCSDT, regardless of whether electrons are frozen. By contrast, 
ROCCSD(T) is only uniquely defined if all electrons are correlated. If some are 
'frozen', two ambiguities arise: (1) whether standard or semicanonical orbitals are used; 
(2) if the latter, whether the 'frozen core' orbitals are included or excluded from the 
semicanonicalization. In practice, performing ROCCSD(T) in a basis of standard orbitals 
requires an additional 0(N 7 ) step, so therefore the two main practical implementations 
(Watts-Gauss-Bartlett j^, a.k.a. ACES II, and Werner-Hampel-Knowles . a.k.a. 
MOLPRO) use semicanonicalization. Their mutual nonequivalency arises on point (2), in 
that ACES II includes the frozen core orbitals in the semicanonicalization, while MOLPRO 
excludes them. 

ROCCSDT with all electrons correlated would be devoid of such ambiguities. However, 
basis set limit calculations at this level are manifestly not a practical approach. Rather, we 
shall apply the following additivity approximation: 



E [CCSDT] ST* « £[CCSD(T)™] + £[CCSDT - CCSD(T)]^ 

U - CCSD(T) val ] 



+ £[CCSD(T) all -CCSD(T\ llimit3 



+ £[CCSDT all - CCSD(T) all - CCSDT val + CCSD(T) val ] limit4 (2) 

The first term is the sum of our SCF, CCSD, and (T) limits. The second term is 
our valence T3— (T) correction. The third term is our calculated core- valence correction, 
while the fourth and final term is the differential T3— (T) contribution to the core- valence 
correction. The different 'limit' labels reflect the fact that each component is obtained at a 
different approximate basis set limit — if the true basis set limit were available in each case 
the 'is approximately equal to' sign would change to an equality. 

The all-electron CCSDT calculations involved in the final term strain our computational 
resources to the limit even for the heavier lst-row systems: they will be effectively impossible 
for most 2nd-row systems. 

In Table IIII| we have gathered values for this final term using ACES II and MOLPRO 
definitions. It can be seen there that the corrections are generally negative for ACES and 
positive for MOLPRO, and that their absolute values are quite small even for significantly 
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multireference systems. Only for pathologically multireference molecules like C2 and BN do 
they even exceed 0.1 kcal/mol. 

The same Table also contains quasiperturbative T4 differential contributions defined as 
follows: 



AQ « £[UCCSDT(Q) - UCCSDT] aU - £[UCCSDT(Q) - UCCSDT] va i 
AAQ = AQatom — AQ mo \ ecu \ e (3) 

atoms 

Differential T4 contributions become somewhat significant for pathologically multirefer- 
ence molecules but are insignificant for our purposes otherwise. 

In summary, the post-CCSD(T) contribution to the core- valence correlation energy - 
at least for first-row systems — appears to be negligibly small in most cases. Its neglect 
will slightly bias TAEs upward if the ACES definition of valence CCSD(T) is used, and 
slightly downward if the MOLPRO definition is used instead. We have therefore elected to 
take an average. Specifically, we shall be using the MOLPRO definition throughout, except 
for adding in one-half the ACES— MOLPRO difference extrapolated from cc-pVDZ and cc- 
pVTZ basis sets. As can be seen in Table 1111} this difference converges extremely rapidly 
with the basis set, and values extrapolated from cc-pV{D,T}Z and cc-pV{T,Q}Z basis sets 
are basically indistinguishable. 



G. Definitions of W4 theory and variants 

For consistency, as long as we are investing the quite formidable computational effort 
required for the improved post-CCSDT corrections, we might want to improve the SCF, 
CCSD(T), and core correlation contributions relative to those used in W2 and W3 theory. 

The use of larger basis sets for the SCF and valence CCSD(T) correlation was already 
considered in some detail in the W3 paper [5]. With the hardware detailed in the Methods 
section, we found that we could carry out CCSD/AV6Z and CCSD(T)/AV5Z calculations 
using conventional algorithms for all the systems considered in the present paper. 

We thus propose the following protocol for W4 theory: 

• the ROHF-SCF contribution is extrapolated from AV5Z and AV6Z basis sets 



using the recently proposed Martin-Karton modification 
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4lj of Jensen's extrapolation 



formula 42] : 

E UF ,l = £hf,oo + A{L + 1) exp(-9v / L) (4) 

the RCCSD valence correlation energy is calculated using AV5Z and AV6Z basis set, 
using the Watts-Gauss-Bartlett definition for open-shell systems. Following the 



suggestion of Klopper[43j, it is partitioned in singlet-coupled pair energies, triplet- 
coupled pair energies, and T\ terms. (The term linear in the single excitations T\ in 
the CCSD equations is nonzero for open-shell CCSD calculations using semicanonical 



orbitals, see, e.g., Ref. 44j.) The 7\ term (which exhibits very weak basis set 
dependence) is simply set equal to that in the largest basis set, while the singlet- 
coupled and triplet-coupled pair energies are extrapolated by the expression 

i E(L)-E(L-1) 

with 0:5=3 and «t=5, and L set equal to the maximum angular momentum present 
in each basis set (i.e., 5 for AV5Z and 6 for AV6Z). These expressions are physically 
motivated by the partial-wave expansion of pair correlation energies in helium-like 



atoms 
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l by tne partial-wave expansion 01 pair correlatic 
as we U as by empirical observation P. IsoT]. 



the (T) valence correlation energy was extrapolated using the same expression with 
a=3, from AVQZ and AV5Z calculations. Note that only a CCSD calculation is 
required in the largest (AV6Z) basis set. For open-shell systems, the Werner-Knowles- 
Hampel (a.k.a. MOLPRO) definition (53] of the restricted open-shell CCSD (T) energy 
is employed throughout, rather than the original Watts-Gauss-Bartlett |4fi| (a.k.a. 
ACES II) definition, unless indicated otherwise; 

the CCSDT— CCSD(T) difference is extrapolated using Eq.(jnj) from 
CCSDT-CCSD(T) differences with PVDZ and PVTZ basis sets 

the T 4 difference was estimated from eq.Q. For this contribution, UHF reference 
determinants are used for want of an restricted open-shell CCSDT(Q) code. In any 
case, RCCSDT and UCCSDT energies are very close for all the systems considered 
here, and we have no reason to believe that there is any significant error introduced in 
this term by spin contamination. One might argue about the use of RCCSDTQ/cc- 
pVDZ versus UCCSDTQ/cc-pVDZ in Eq.(l). We have considered both options. For 
the closed-shell molecules, the results are equivalent to within less than 0.01 kcal/mol, 
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but one can see larger differences for radicals with significant spin contamination. For 
radicals like 2 and N0 2 , the ROCCSDTQ based estimate yields results substantially 
closer to the very precisely known experimental values, and we have therefore retained 
this choice. 

the T5 contribution was estimated from CCSDTQ5/DZ calculations 

the difference between ACES II and MOLPRO definitions of the valence ROCCSD(T) 
definition is computed at the CCSD(T)/cc-pVTZ and CCSD(T)/cc-pVQZ levels and 
extrapolated using Eq.© (with a=3). One-half of this contribution is added to the 
final result. 

the inner-shell correlation contribution was extrapolated using Eq.(j5|) (with a=3) from 
CCSD(T)/aug-cc-pwCVTZ and CCSD(T)/aug-cc-pwCVQZ calculations 

the scalar relativistic contribution is obtained from the difference between nonrela- 
tivistic CCSD(T)/aug-cc-pV(Q+d)Z and second-order Douglas-Kroll CCSD(T)/DK- 
aug-cc-pV(Q+d)Z calculations. We found that there is essentially no difference 
between such contributions obtained using the unpublished Martin-Oren Douglas- 



Kroll optimized basis sets 33] and the publicly available PNNL Douglas-Kroll 
contracted correlation consistent basis setsj^], and have chosen the latter as the 
nonrelativistic result can be recycled from the (T) step. 

• atomic spin-orbit coupling terms are taken from the experimental fine structure. The 
spin-orbit splitting constant for NO — the only molecule considered here with first- 
order spin-orbit coupling — was taken from Ref.j^]]]. It was previously shown 3} that 
both atomic and molecular spin-orbit coupling constants can be computed as well 
without significant loss of accuracy. 

• Finally, diagonal Born-Oppenheimer corrections are obtained from ROHF/AVTZ 
calculations. 

n 

In the HEAT paper (and 'superHEATed' methods discussed therep]), unrestricted 
reference wave functions are used throughout and inner-shell electrons are only frozen in 
the post-CCSD(T) correlation treatments. Only first-row molecules were considered there. 

Finally, we note that for electron affinities — as well as for systems like MgO that are 
both very polar and pathologically multireference in character — the use of augmented basis 



14 



sets in the T 3 — (T) and T 4 steps is very strongly recommended. It was previously shown 
for Na, Mg, and especially K and Ca s yste ms that the inner-valence shell of these metals 
should be added to the valence shell 



Da s yste : 



H. Validation against Active Thermochemical Tables data 

Individual components of the final W4 results can be found in Table A comparison 
between the final W4 values and the Active Thermochemical Tables data (with associated 
95% confidence intervals) is given in Table IVTl |54j | 

A remark is due on the anharmonic zero-point vibrational energies used. As the 
uncertainty in some of the ZPVEs employed in the W1/W2/W3 series of papers is on the 
order of 0.1 kcal/mol, we have re-examined the data available: source details are given in 
the footnotes to Table IVTl For two species, CH3 and CH4, the best available data represent 
revisions in excess of 0.1 kcal/mol. 

For the first-row systems, agreement between W4 and ATcT data can only be described 
as excellent, with an RMSD (root mean square deviation) of 0.083 kcal/mol and a mean 
absolute deviation (MAD) of 0.065 kcal/mol. (This implies a 95% confidence interval of 
about 0.16 kcal/mol.) The mean signed deviation (MSD) is essentially zero, at —0.011 
kcal/mol, suggesting that the method is free of systematic bias. 

If we add in the 2nd-row species, RMSD and MAD go up to 0.15 and 0.09 kcal/mol, 
respectively. However, basically all the extra error is caused by a single molecule, C1CN, for 
which the ATcT value is mostly based on prior theoretical calculations. If we remove this 
data point, RMSD and MAD drop to values equivalent to those for the first row alone, 

Including the remaining W3 species, and using the earlier experimental data compiled in 
Ref. [3] for species lacking an ATcT value, we find both RMSD and MAD to be 0.15 kcal/mol 
with C1CN included, both dropping to 0.10 kcal/mol with C1CN excluded. 

Overall, excepting C1CN, the discrepancy between W4 and experiment reaches or exceeds 
1 kJ/mol for only 2 species: PH 3 (+0.31 kcal/mol) and 3 (-0.24 kcal/mol). For PH 3 , the 
calculated result is still within the experimental error bar (±0.41 kcal/mol). 

For the first-row and part of the second-row species, we can add in the 
CCSDT/cc-pCVTZ— CCSD(T)/cc-pCVTZ differential core- valence contribution to the total 
atomization energy, thus obtaining what we term W4.2 theory. (For second-row elements, 
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the cc-pwCVTZ basis set was used.) Here, differences between ACES and MOLPRO 
definitions of CCSD(T) are on the order of 0.01 kcal/mol or less, as they ought to be. For the 
ATcT lst-row systems, W4.2 theory represents a slight improvement over W4 theory: the 
W4.2 error statistics are RMSD=0.067, MSD=+0.003, MAD=0.056 kcal/mol. (This implies 
a 95% confidence interval of about 0.13 kcal/mol.) As expected, the largest improvement 
is seen for ozone (from 0.24 to 0.14 kcal/mol): we expect that this error would be reduced 
further if a differential T 4 contribution to the core correlation could be included and/or a 
valence T 6 contribution could be added. The former requires about 92 billion determinants 
in the cc-pCVTZ basis set, which is sadly beyond our available computational resources. 
(An at best semiquantitative estimate can be obtained at the CCSDT(Q)/cc-pCVDZ level: 
this suggests a narrowing of the gap by 0.04 kcal/mol, to a rather pleasing 0.1 kcal/mol.) As 
for the latter, a crude estimate of the T 6 contribution can be obtained as one-fourth the T 5 
contribution (see Section Till Jl below) . i.e., a further increase by 0.1 kcal/mol which would 
bridge the gap essentially completely. 

At the W4.2 level, our biggest remaining error is for CI2 (-0.17 kcal/mol). Using larger 
basis set CCSDT— CCSD(T) and T4 corrections reduces the discrepancy to -0.12 kcal/mol, 
which is further reduced to -0.10 kcal/mol when a larger basis set is used for T 5 as well. 

Based on this observation, we are quoting an additional set of results, labeled 'W4.3' 
in the Table. These represent the results of more rigorous accounting for valence post- 
CCSD(T) effects — all other contributions are the same as for W4.2. Specifically, we are 
(a) using our 'best estimate' valence T4 contributions; (b) CCSDT-CCSD(T) is extrapolated 
from cc-pV{T,Q}Z basis sets; (c) the T5 contribution is obtained with the cc-pVDZ basis 
set; (d) an approximate T 6 contribution was obtained at the CCSDTQ5(6)/cc-pVDZ level. 

Relative to W4.2, the performance of W4.3 is somewhat mixed. For some systems - 
notably O2, CI2, OF, and to a lesser extent, CO — agreement with experiment is markedly 
improved, for others (notably N2) it deteriorates. For the remainder of the systems, the 
W4.2-W4.3 difference is either too small to affect anything, or the two values err on 
opposite sides of experiment. In all probability, for a W4.3 type method to yield any 
further improvement, the valence CCSD extrapolations would need to be rendered still more 
rigorous, as we believe the W4.2-W4.3 differences are on the order of the valence CCSD 
uncertainty in a number of our systems. 

Finally, for some systems we are actually able to add the (Q)/cc-pCVTZ contribution 
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to the core-valence correlation term. We have not tabulated these results, as they do not 
appreciably affect the systems given in the Table for which we could obtain them. Their 
inclusion might be worthwhile in systems like C2 or BN, or — as noted above, and if the 
daunting computational requirements could be met — ozone. 

I. An improved and more cost-effective W3 theory 

Stanton and coworkers have arguedQ] that CCSDT(Q) might be closer to FCI than 
CCSDTQ, for similar reasons as in the case of CCSD(T) vs. CCSDT0- 

This begs the 

question if one could not derive a lower-cost variant of W3 theory involving CCSDT(Q)/cc- 
pVDZ. 

Eliminating the anomalous cases of BeO and MgO, we find that 
1.025(£[CCSDT(T)/cc - pVDZ] - £[CCSDT/cc - pVDZ]) reproduces the W4 estimate 
of post-CCSDT correlation effects with R 2 = 0.974, the largest deviations being seen for 
the B2 and CS molecules. When these latter two systems are eliminated, the coefficient 
changes to 1.012 (only semantically different from unity for our purposes), with R 2 = 0.991. 
Carrying out regression instead to our 'best estimate' post-CCSDT contributions, we find 
three clear outliers (B2, CS, and CI2): upon eliminatingthem, we find a coefficient of 1.000 
with R 2 =0.990. We thus essentially recover the HEAT 6] post-CCSDT contribution. 

The component breakdown for W3.2 theory is given in the Supporting Information: 
the final W3.2 results can be found in Table IVI1 compared with W4 and experiment. 
Occasionally a substantial deviation from W4 theory is seen (notably for B 2 , somewhat 
less so for CS) but by and large, W3.2 theory appears to be closer to W4 theory than the 
original W3 theory. 

We might then define W2.2 theory as W3.2 theory without the post-CCSD(T) 
contributions. 

For the ATcT species minus C1CN, W3.2 theory obtains an RMSD=0.16 kcal/mol and 
MAD=0.12 kcal/mol. This is competitive with W3 theory itself, at considerably reduced 
computational cost. Neglecting post-CCSD(T) correlation effects altogether, we obtain W2.2 
theory. The latter does very well, as expected, for species dominated by a single reference 
determinant, but yields unacceptable errors for species like N 2 0, NO2, and especially ozone. 

Finally, let us consider W4 theory in which just the post-CCSDT contribution is 
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approximated as unsealed (Q)/cc-pVDZ; "W41ite", as it were. Its performance is 
intermediate between W3.2 and full W4: for the ATcT species less C1CN, MAD=0.09 and 
RMSD=0.12 kcal/mol. The performance differential with W3 is primarily due to the larger 
basis sets employed in the various CCSD(T) level contributions. "W41ite" may be applicable 
to some systems where CCSDT(Q)/cc-pVTZ and/or CCSDTQ/cc-pVDZ are prohibitive in 
computational cost. 

A proposed convergent hierarchy of methods (at increasing cost) might thus be Wl — > 
W2.2 -> W3.2 -> W41ite -> W4. 



J. A simple energy-based diagnostic for nondynamical correlation effects 

A number of diagnostics have been proposed for nondynamical correlation character of 
a system, such as the T\ diagnostic of Lee and Taylor [59] and the T>\ and £> 2 diagnostics of 
Nielsen and Janssenjsfj|. Other researchers (particularly those involved with multireference 
methods) consider such indicators as the largest T 2 amplitudes or the HOMO and LUMO 
natural orbital occupations. Such diagnostics are gathered in Table IIX1 together with 
some energy-based quantities — specifically, the percentages of the total atomization energy 
accounted for by SCF, (T) triples, and T 4 + T 5 . 

The latter is probably the best yardstick for imperfections in CCSD(T) for our purposes, 
but of course it is an a posteriori criterion. An a priori indicator for whether the formidable 
computational effort entailed by post-CCSD(T) methods is necessary would be highly 
desirable. 

Of the various diagnostics displayed in Table HXl the %TAE[(T)] criterion appears to be 
the best predictor for %TAE[T 4 + T5], with a squared correlation coefficient R 2 = 0.791. 
If the somewhat anomalous systems BeO and MgO are deleted, we find a linear regression 
%TAE[T 4 + T 5 ]^0.126%TAE[(T)], with the squared correlation coefficient increasing to a 
respectable i? 2 =0.941. (The regression slope including BeO and MgO is 0.11.) 

The second-best predictor appears to be %TAE[SCF]. With BeO and MgO eliminated, 
we find a linear regression %TAE[T 4 +T 5 ] « -0.0199%TAE[SCF] + 1.5997, with R 2 = 0.810. 

The following approximate predictors for specific higher-order excitations can be obtained 
in the same manner: %TAE[Ts] rs 0.095 %TAE[T 4 ] (eliminating just OCS as an outlier, 
i? 2 =0.922) and %TAE[T 6 ] « 0.24 %TAE[T 5 ] (no outliers, #2=0.917). While these are 
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obviously no replacement for explicit calculations, they are quite useful in establishing 
whether a post-CCSD(T) approach such as W3 or W4 theory is required in the first place. 

Note also that, as an error estimate for W2 theory, 0.126%TAE[(T)] is rather pessimistic, 
as %TAE[T4 + T5] will be compensated to greater or lesser extent by higher-order connected 
triple excitations. 

Turning now to the other diagnostics, we note clear failures for all of them. We note, for 
instance, the deceptively low %_ diagnostic of F 2 , or the fact that the largest T 2 amplitude 
for H 2 CO is larger than that for O2, despite the latter having almost an order of magnitude 
greater %TAE[T4 +T5]. The LUMO natural orbital occupation appears to track %TAE[T4 + 
T5] somewhat more consistently, although it requires the CCSD response to calculate (which 
is not available in the two most commonly used quantum chemical codes, Gaussian 03 and 
MOLPRO). 

We conclude that the simple energy-based diagnostics %TAE[(T)] and %TAE[SCF] 
which require no additional software or computational effort to calculate — are the most 
useful, at least for thermochemical applications. 

For general 'user convenience', let us address how to interpret the numerical values of 
%TAE[SCF] and %TAE[(T)] in a qualitative sense. The data in Table HXl suggest that: 

• %TAE[(T)] below 2% indicate systems dominated by dynamical correlation 

• %TAE[(T)] between 2% and about 4-5% indicate mild nondynamical correlation 

• %TAE[(T)] between 4-5% and about 10% indicate moderate, 

• and values in excess of 10% severe, nondynamical correlation 

The %TAE[SCF] data offer less detail, but as a rule of thumb, %TAE[SCF] above 
66.7% (two-thirds) suggest systems largely or wholly dominated by dynamical correlation, 
and %TAE[SCF] below 20% (one-fifth) — particularly negative values — indicate severe 
nondynamical correlation. 

Finally, in light of the fairly good correlation (without outliers) between Tg and T5 and the 
small numbers involved, we could consider adding 0.24ATAE(T 5 ) as an empirical correction 
for connected sextuple excitations. Doing so somewhat violates the general spirit of the 'Wn 
theory' family (as we are estimating a contribution to the Hamiltonian rather than explicitly 
calculating it), and we shall not report any such values (the interested reader can easily 
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obtain them using a pocket calculator and Tables HP and |VT|) . For such a putative 'W4as 
theory' ('as' for 'approximate sextuples'), error statistics for the lst-row ATcT systems 
would indeed be improved somewhat compared to regular W4: MSD=-0.008, MAD=0.059, 
RMSD=0.069 kcal/mol. (Also including the 2nd-row ATcT species, we have MSD=-0.005, 
MAD=0.059, RMSD=0.071 kcal/mol.) For a putative 'W4.2as theory', the corresponding 
lst-row error statistics are MSD=-0.016, MAD=0.049, RMSD=0.060 kcal/mol. As noted 
above, the single most notable improvement would be for ozone. 



K. Prospects for application to heavier-element systems 

Moving beyond Ar (or perhaps Ca) in the Periodic Table without sacrificing accuracy 
will entail some additional considerations. 

One of them is second-order spin-orbit coupling. It has been neglected for the systems 
considered here: for HF, HC1, F2, and CI2, however, we can compare the sum of scalar 
relativistic corrections and atomic spin-orbit coupling to relativistic corrections obtained by 
Visscher et al. j3, Ifi^ from full four-component relativistic coupled cluster calculations. In 



all cases, differences are well below 0.1 kcal/mol. However, previous work at PNNL 64J on 
bromine and iodine compounds suggested approximate second-order spin-orbit contributions 
for Br2 and I2 of 0.4 and 2.0 kcal/mol, respectively, which clearly cannot be neglected with 
impunity. 

Secondly, for such heavy elements, basis set superposition error may need to be 
accounted for in extrapolations, although small-core ECP-based basis sets may offer a better 
alternative. 

Thirdly, more sophisticated basis set extrapolation schemes may be called for, especially 
in transition metal systems. Research on this issue is currently under way in our laboratory. 

Fourthly, for more heavily multireference transition metal systems, CCSD(T) may no 
longer be adequate for geometry optimizations. 



IV. CONCLUSIONS 



We have proposed a new computational thermochemistry protocol, named W4 theory, 
for first- and second-row main group elements and validated it against benchmark values 
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obtained from the latest version of the ATcT (Active Thermochemical Tables) network. 
For key species with well-established atomization energies, W4 theory reaches an average 
accuracy better than 0.1 kcal/mol, including for such difficult species as ozone. Most systems 
for which the earlier W3 theory is feasible are amenable to W4 calculations. 

The recent CCSDT(Q) method offers a cost-effective way of estimating T4, but is 
insufficient by itself if the molecule exhibits some nondynamical correlation. The latter 
considerably slows down basis set convergence for T 4 , and anomalous basis set convergence 
in highly polar systems makes two-point extrapolation procedures unusable. However, we 
found that the CCSDTQ— CCSDT(Q) difference converges quite rapidly with the basis 
set, and that the formula 1.10[CCSDT(Q)/cc-pVTZ+CCSDTQ/cc-pVDZ-CCSDT(Q)/cc- 
pVDZ] offers a very reliable as well as fairly cost-effective estimate of the basis set limit T4 
contribution. 

The T 5 contribution converges very rapidly with the basis set, and even a simple double- 
zeta basis set appears to be adequate. The largest T5 contribution found in the present 
work is on the order of 0.5 kcal/mol (for ozone). DBOC corrections are significant at the 
0.1 kcal/mol level in hydride systems. 

Post-CCSD(T) contributions to inner-shell correlation are quite small, except in systems 
with severe nondynamical correlation. 

We further propose low-cost versions, which we denote W41ite, W3.2, and W2.2 theory. 
W3.2 theory is found to be about as reliable as the original W3 theory (at much reduced 
cost), and supersedes the latter. W2.2 theory is essentially W2w theorypj with a diagonal 
Born-Oppenheimer correction added. W41ite is intermediate in accuracy between W3.2 
and full W4, and may be applicable to some systems beyond the reach of full W4. The 
sequence Wl— >W2.2— >W3.2— >W41ite— >W4 forms a convergent hierarchy of computational 
thermochemistry methods, where all steps required in W2.2 theory can be recycled for W3.2 
theory, and all steps required in the latter can be recycled for W4 theory. 

Finally, we have considered various diagnostics for the importance of post-CCSD(T) 
correlation effects, and find the simple energy based criteria %TAE[(T)] and %TAE[SCF] 
to be the most useful for our purposes. 
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Appendix I: Effect of different reference wave functions on energies obtained from 
high-order coupled cluster methods 

For open-shell species, differences between results obtained with different reference wave 
functions are liable to cause some confusion, which we would like to address here. Some 
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illustrative data can be found in Table IVIII1 

At the full CI level with all electrons correlated, restricted and unrestricted wave functions 
should give the same answers, since the ROHF and UHF orbitals are related by a unitary 
transformation, under which FCI is invariant. 

This will no longer be the case when core orbitals are frozen: even at the FCI level there 
will be small but finite differences between the total energies. For the first-row atoms B-F, 
we found these to be in the 20-50 microhartree range. Small as these numbers may seem, 
for a polyatomic they may add up to a nontrivial discrepancy between ROFCI and UFCI 
atomization energies. 

Likewise, when core orbitals are frozen, results — even at the FCI level — become slightly 
dependent on whether one uses standard or semicanonical orbitals. 

For fully iterative coupled cluster methods such as CCSD, CCSDT, CCSDTQ,. . . , there 
will be nonzero differences between restricted open-shell and unrestricted energies due to 
spin contamination, even when all electrons are correlated. The difference tapers off to zero 
as the FCI limit is approached: for the first-row atoms B-F, the ROCCSDT — UCCSDT 
differences with all electrons correlated are already down to the 100 nanohartree range. 

For CCSDT, CCSDTQ, . . . calculations with frozen cores, the difference between 
restricted open-shell and unrestricted data will normally be dominated by the intrinsic 
ROFCI - UFCI difference, which even for CCSDT (a fortiori for CCSDTQ and CCSDTQ5) 
is several orders of magnitude larger than the spin contamination discrepancy. 

Quasiperturbative methods entail an additional complication. For instance, in the 
CCSD(T) method, the underlying CCSD is invariant to unitary rotations within the orbital 
space for both restricted open-shell and unrestricted reference orbitals, but the (T) term 
is not invariant in the case of a restricted open-shell reference. Both of the leading 
implementations of ROCCSD(T) — Watts et al.[46] as implemented in Aces II and other 
codes, and Hampel et al. as implemented in MOLPRO — involve transformation from 
standard to semicanonical orbitals. The ROCCSD(T) energies of these two implementations 
are equivalent when all electrons are correlated. However, in the case of frozen core orbitals, 
the definitions are nonequivalent: while Aces II carries all orbitals in the semicanonicalization 
and then freezes out the semicanonical core orbitals in the integral transformation and 
coupled cluster steps, MOLPRO skips the core orbitals even at the semicanonicalization 
stage j3| . The resulting CCSD(T) atomization energies are slightly different (Table IVTTT|) . 
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up to about 0.1 kcal/mol. (The ROCCSD(T) and UCCSD(T) data are rather more different, 
as expected.) 

What are the consequences of all this for a computational thermochemistry protocol 
like W4? First of all, the discrepancy between approximate ROFCI and UFCI limits in 
the valence-only steps will be offset by an (almost) identical but opposite discrepancy in the 
core-valence correction. (In practice, in a putative UW4 method a small fraction of the total 
atomization energy would move from the valence correlation to the inner-shell correlation 
contribution.) 

Secondly, the inequivalence between the (T) implementations will introduce inequiv- 
alences at three stages: two nearly equal but opposite changes in the valence (T) and 
core- valence correlation steps, respectively, and another in the valence CCSDT— CCSD(T) 
difference. If we were including a core-valence CCSDT— CCSD(T) difference as well, we 
would once again have approximate cancellation of the inequivalences (as we do find in this 
work for W4.2 theory). In W4 theory, we are neglecting the latter term, so a slightly different 
result would be obtained depending on whether one used, in the CCSDT— CCSD(T) step, 
CCSD(T) energies from ACES II or MOLPRO, or an average of the two (as in the present 
work) . 
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TABLE I: Basis set convergence of CCSDTQ-CCSDT, CCSDT(Q)-CCSDT and 
CCSDTQ-CCSDT(Q) (ATAE, in kcal/mol) 
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(a) Extrapolated from CCSDT(Q)/cc-pVTZ and CCSDT(Q)/cc-pVQZ basis sets, plus CCSDTQ/cc-pVTZ - 
CCSDT(Q)/cc-pVTZ difference 



31 



TABLE II: Contribution of connected quintuple (T5) and sextuple (Tq) excitations to the total 
atomization energies (in kcal/mol) 
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0.01 


0.01 




HF 


0.01 


0.00 


0.01 


0.00 


0.01 


0.00 


0.00 


0.00 


0.00 


0.00 


N 2 


0.11 


0.11 


0.12 


0.12 


0.11 


-0.01 


-0.01 


0.02 0.00 


0.02 


0.01 


NH 3 


0.01 


0.01 


0.01 


0.01 


0.01 


0.00 


0.00 




0.00 


0.00 


N2O 


0.19 




0.04 


0.06 


0.19 


0.14 






0.02 




NO 


0.09 


0.08 


0.11 


0.10 


0.09 


-0.02 


-0.01 




0.01 


0.01 


O2 


0.10 


0.10 


0.11 


0.11 


0.10 


0.00 


-0.01 




0.01 


0.01 


O3 


0.41 




0.51 


0.47 


0.41 


-0.09 










c 2 


0.27 


0.32 


0.47 


0.47 


0.27 


-0.20 


-0.15 


0.07 0.06 


0.07 


0.06 


BN 


0.18 


0.16 


0.11 


-0.11 


0.18 


0.07 


0.27 


0.04 0.02 


0.07 


0.01 


MgO 


-0.08 


-0.04 


-0.97 


-1.05 


-0.08 


0.88 


1.00 




-0.28 


-0.30 


BcO 


-0.13 


-0.11 


-0.72 


-0.72 


-0.13 


0.59 


0.62 


-0.03 


-0.15 


-0.16 


CN 


0.13 


0.12 


0.16 


0.14 


0.13 


-0.03 


-0.03 




0.01 




NO2 


0.19 




0.20 


0.11 


0.19 


-0.01 










Cl 2 


0.00 


0.02 


0.00 


0.02 


0.00 


0.00 


0.00 




0.00 


0.00 


C1F 


0.02 


0.02 


0.02 


0.02 


0.02 


0.00 


0.00 




0.00 


0.00 


CS 


0.06 


0.05 


0.05 


0.01 


0.06 


0.00 


0.04 


0.00 


0.00 


0.00 


H 2 S 


0.00 


0.00 


0.00 


0.00 


0.00 


0.00 


0.00 




0.00 


0.00 


HC1 


0.00 


0.00 


0.00 


0.00 


0.00 


0.00 


0.00 




0.00 


0.00 


HOC1 


0.02 




0.02 


0.03 


0.02 


0.00 






0.00 




PH 3 


0.00 


0.00 


0.00 


0.00 


0.00 


0.00 


0.00 




0.00 


0.00 


SO 


0.05 


0.06 


0.05 


0.06 


0.05 


0.00 


0.00 




0.01 


0.01 


S0 2 






0.14 




0.14 


0.00 










OCS 






-0.01 


-0.04 


0.05 


0.00 










C1CN 


0.11 




0.11 


0.12 


0.11 


0.00 










C2H4 


0.03 




0.02 


0.03 


0.03 


0.00 






0.00 




H 2 CO 


0.03 




0.04 


0.03 


0.03 


-0.01 






0.00 




HNO 


0.07 


0.08 


0.08 


0.09 


0.07 


-0.01 


-0.01 




0.01 
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TABLE III: Basis set convergence of CCSDT-CCSD(T); post-CCSD(T) core correlation 
contributions calculated with the cc-pCVTZ basis set (ATAE, in kcal/mol) and basis set 
convergence of CCSD(T) MOLPRO-ACESII differences in the TAE (kcal/mol) 





CCSDT- 


-CCSD(T) (a) 


Post-CCSD(T) core 


corr. 


CCSD(T) MOLPRO-ACESII diff. 




{PVDZ, {PVTZ, 


QUI. 


ACES 


MOLPRO 




PVDZ PVTZ PVQZ {PVDZ, 


/PVT7 




PVT71 
1 V 1 Zjj 


PV071 




AA[T 3 -(T)] AA[f 3 -(T)] AA(Q) 








PVT71 
r V 1 Zjj 


1^ V SJZjj 


H20 


n 10 
-U.lo 


n on 
-U.ZU 


A AQ 

-U.Uo 


09 
-u.uz 


m 
u.ui 


00 

U.UU 


A AO 


A AO 


A AO 

U.Uz 


A AO 


A AO 

U.Uz 


Ro 

D 2 


n 1 q 
U.lo 


a H7 
U.U / 


n 11 
-U.ll 


oa 


O OA 


O 07 
U.U / 


A AA 
U.UU 


A AA 
U.UU 


A AA 
U.UU 


A AA 
U.UU 


A AA 
U.UU 




a ftQ 
-U.Do 


a ftc 
-U.Do 


n aa 
U.UU 


n m 

-U.UI 


03 

U.UO 


01 
U.UI 


A AQ 

U.Uo 


A A/1 

U.U4 


A A/1 

U.U4 


A A/1 
U.U4 


A A/1 

U.U4 


V_/±±3 


n m 
-U.U1 


n aq 
-U.Uo 


n ai 
-U.U1 


-U.ul 


ni 

U.UI 


00 
U.UU 


A A1 
U.UI 


A AO 
U.VZ 


A AO 
U.UZ 


A AO 

U.Uz 


A AO 

U.Uz 


CH4 


-U.UD 


a H7 
-U.U I 


n ai 
-U.U1 


01 


O 01 

u.ui 


00 
U.UU 


A A1 
U.UI 


A AO 


A AO 

U.Uz 


A AO 

U.Uz 


A AO 

U.Uz 


CH 


U.lz 


A 1 O 

U. Iz 


n aa 
U.UU 


m 


01 

u.ui 


00 
U.UU 


A A1 
U.UI 


A AI 
U.UI 


A AO 

U.Uz 


A AO 

U.Uz 


A AO 


CO2 


-u.yu 


n no 

-u.yy 


n aq 
-U.Uo 


03 

-U.UO 


O OA 




A ac; 
U.UO 


A Aft 
U.UD 


A H7 
U.U ( 


a ri7 
U.U / 


A AT 
U.U ( 


CO 


-U.Ol 


-U.Ol 


n aa 
U.UU 


-u.uz 


09 

u.uz 


O 09 

u.uz 


A AQ 

U.Uo 


A A/1 

U.U4 


A A/1 
U.U4 


A A/1 

U.U4 


A At; 

u.uo 


r 2 


n Q/i 
-U.o4 


A OA 

-u..zy 


n ac; 
U.UO 


n 09 
u.uz 


0*3 
U.UO 


01 

u.ui 


A A1 
U.UI 


A AI 
U.UI 


A AI 
U.UI 


A AI 
U.UI 


A AI 

U.UI 


HF 


-U. lo 


-U.lO 


n ao 
-U.UZ 


-U.ul 


00 
U.UU 


00 
U.UU 


A AA 
U.UU 


A AI 
U.UI 


A AI 
U.UI 


A AI 
U.UI 


A AI 
U.UI 


i>2 


a fto 

-u.oy 


A ftft 
-U.DO 


n ao 


n 0^ 

-U.UO 


OA 


01 
U.UI 


A A7 
U.U I 


A OO 

u.uy 


A 1 A 
U. 1U 


A 1 A 
U.1U 


A 1 A 
U.1U 


IN 113 


a no 

-u.uy 






n a 1 * 

-U.UO 


01 

u.ui 


00 
U.UU 


A AQ 

U.Uo 


A At; 

u.uo 


A AK 
U.UO 


a ae; 

u.uo 


A AK 
U.UO 


N20 


1 QQ 

-l.oo 






n 0^ 

-U.UO 


O 07 
U.U ( 




A AQ 

U.Uo 


A 1 1 

U.ll 


A 1 O 

U. i-Z 


A 1 O 

U.lz 


A 1 Q 

U.lo 


NO 


a /i a 
-U.4y 


A A T 

-U.4 ( 


n ao 
U.UZ 


n 03 

-U.UO 


0*3 
U.UO 


O 09 

u.uz 


A Ac; 
U.UO 


A Aft 
U.UD 


A H7 
U.U / 


A AT 

U.U / 


A AT 
U.U / 


^ 2 


-U. ( 1 


A ft/1 

-U.D4 


A AT 
U.U / 


n no 

U.UU 


0*3 
U.UO 


01 

u.ui 


A AO 


A AQ 

U.Uo 


A AQ 

U.Uo 


A AQ 

U.Uo 


A AQ 

U.Uo 


^3 


1 07 






n oi^ 

U.UD 


n iq 
U.lO 




A Ac; 
U.UO 


A Aft 
U.UD 


A H7 
U.U ( 


A AT 

U.U / 


A AT 

U.U ( 


v^2 


O 1 r; 
-Z. 10 


O 1 A 

-z. iy 


A Ac; 
-U.UO 


n 1 ^ 
u. 10 


1 Q 

u.iy 


OS 
U.Uo 


A AQ 

U.Uo 


A A/1 

U.U4 


A A/1 
U.U4 


A A/1 

U.U4 


A A/1 

U.U4 


BN 


-z.oy 


O X.Q 
-Z.Do 


A A1 
U.U1 


U.U ( 


12 


12 


A A/1 
U.U4 


A A^ 
U.UO 


A A^ 
U.UO 


A A^ 
U.UO 


A Aft 

U.UD 


CN 


-U.Zo 


A 07 

-yj.z ( 


A A1 
U.U1 


-U.UO 


0*3 
U.UO 


O 03 
U.UO 


A Ac; 
U.UO 


A AT 

U.U / 


A H7 
U.U / 


A AT 
U.U / 


A AT 
U.U / 


NO2 


a oq 

-u.yo 






A A9 
-U.U.Z 


A AY 
U.U ( 




a at 
U.U / 


A AA 

u.uy 


A AA 

u.uy 


A 1 A 
U. 1U 


A 1 A 
U.1U 


CI2 


-0.35 


-0.37 


A AO 

-V.UZ 


A A9 
-U.U.Z 


A AI 
-U.UI 




0.01 


0.02 


0.02 


0.02 


A AO 

U.Uz 


C1F 


-0.28 


-0.27 


0.01 


u.u± 


0*3 
U.UO 




0.01 


0.01 


0.02 


0.01 


0.02 


cs 


A e; 
-U.Do 


A 

-U.0 / 


0.01 


-U.UO 


09 




A AO 
V.UZ 


A AQ 

U.Uo 


A AQ 

U.Uo 


A AQ 

U.Uo 


0.04 


n_2 


-U.Uo 


A f)7 
-U.U I 


A A1 

U.U1 


n on 

U.UU 


01 
U.UI 




A A1 
U.UI 


A AI 
U.UI 


A Ol 
U.UI 


A AI 
U.UI 


A AI 
U.UI 


HC1 


n in 
-U. 1U 


A 1 1 

-U.ll 


A A1 

-U.U1 


n m 

-U.ul 


00 
U.UU 




A A1 
U.UI 


A AI 

U.UI 


A AI 
U.UI 


A AI 
U.UI 


A AI 
U.UI 


HOC1 


-u.44 












A AO 
VJ.uZ 


A AQ 

U.Uo 


A AQ 

U.Uo 


A AQ 

U.Uo 


A AQ 

U.Uo 


PH, 
1 


-0.02 


0.07 


A AQ 

U.Uo 


0.02 


0.03 




0.01 


0.01 


0.01 


0.01 


A AI 
U.UI 


SO 


-0.76 


-0.74 


0.02 


0.02 


0.04 




0.02 


0.02 


0.03 


0.03 


0.03 


so 2 


-1.22 












0.04 


0.05 


0.06 


0.06 


0.06 


ocs 


-1.04 












0.04 


0.05 


0.06 


0.06 


0.06 


C1CN 


-1.07 












0.05 


0.07 


0.08 


0.08 


0.08 


C2H4 


-0.42 


-0.43 


-0.01 


-0.01 


0.03 


0.00 


0.03 


0.04 


0.04 


0.04 


0.04 


H 2 CO 


-0.48 


-0.49 


-0.01 


-0.02 


0.02 


0.01 


0.03 


0.04 


0.04 


0.04 


0.05 


HNO 


-0.52 


-0.48 


0.04 


-0.03 


0.04 


0.01 


0.05 


0.07 


0.07 


0.07 


0.08 



(a) All values were calculated using ACESII 
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TABLE IV: Basis set convergence of diagonal Born-Oppenheimer corrections at the Hartree-Fock 
level and of scalar relativistic corrections (TAE, in kcal/mol) 





DBOC 




Scalar relativistic 








Oren 


PNNL 


PNNL 




AVDZ 


AVTZ 


AVQZ AV(T+d)Z AV(Q+d)Z 


H 2 


0.12 


0.13 


-0.26 


-0.27 


-0.27 


B 2 


0.01 


0.01 


-0.06 


-0.06 


-0.06 


C2H2 


0.12 


0.12 


-0.28 


-0.28 


-0.28 


CH 3 


0.05 


0.05 


-0.17 


-0.17 


-0.17 


CH 4 


0.11 


0.10 


-0.19 


-0.19 


-0.19 


CH 


-0.08 


-0.08 


-0.04 


-0.04 


-0.04 


CO2 


0.05 


0.05 


-0.47 


-0.48 


-0.48 


CO 


0.01 


0.02 


-0.16 


-0.16 


-0.16 


F 2 


0.00 


0.00 


-0.02 


-0.03 


-0.03 


HF 


0.08 


0.08 


-0.19 


-0.20 


-0.20 


N 2 


0.02 


0.02 


-0.13 


-0.14 


-0.14 


NH 3 


0.14 


0.14 


-0.25 


-0.26 


-0.25 


N 2 


0.04 


0.04 


-0.45 


-0.46 


-0.46 


NO 


0.01 


0.01 


-0.18 


-0.19 


-0.19 


O2 


0.01 


0.01 


-0.18 


-0.18 


-0.18 


O3 


-0.03 


-0.03 


-0.24 


-0.25 


-0.25 


C 2 


0.03 


0.03 


-0.17 


-0.17 


-0.17 


BN 


0.01 


0.01 


-0.17 


-0.17 


-0.17 


CN 


0.02 


0.02 


-0.15 


-0.16 


-0.16 


NO2 


0.00 


0.00 


-0.42 


-0.43 


-0.43 


Cl 2 


0.00 


0.00 


-0.21 


-0.19 


-0.20 


C1F 


0.00 


0.00 


-0.18 


-0.17 


-0.18 


CS 


0.01 


0.01 


-0.16 


-0.16 


-0.16 


H 2 S 


0.06 


0.05 


-0.39 


-0.39 


-0.40 


HC1 


0.05 


0.04 


-0.25 


-0.25 


-0.25 


HOC1 


0.07 


0.07 


-0.32 


-0.32 


-0.33 


PH 3 


0.16 


0.15 


-0.45 


-0.45 


-0.46 


SO 


0.01 


0.01 


-0.34 


-0.33 


-0.34 


S0 2 


0.02 


0.02 


-0.81 


-0.82 


-0.83 


OCS 


0.03 


0.03 


-0.53 


-0.54 


-0.54 


C1CN 


0.04 


0.04 


-0.44 


-0.44 


-0.45 


C2H4 


0.12 


0.12 


-0.33 


-0.33 


-0.33 


H 2 CO 


0.03 


0.03 


-0.33 


-0.34 


-0.34 


HNO 


-0.05 


-0.05 


-0.27 


-0.27 


-0.27 
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TABLE V: Component breakdown of the final W4 total atomization energies at the bottom of the 
well (in kcal/mol) 



SCF valence valence f 3 -(T) T 4 f 5 inner relativ. spin-orbit DBOC (a) TAE e 







CCSD 


(T) 












shell 
















H 2 


160.02 


69.08 


3.53 


-0.20 





18 





01 


0.38 


-0 


27 


-0.22 


0.13 





02 


232.65 


C2H2 


299.87 


94.73 


8.35 


-0.72 





70 





07 


2.49 


-0 


28 


-0.17 


0.12 





04 


405.18 


CH 3 


243.40 


61.47 


1.90 


-0.03 





05 





00 


1.09 


-0 


17 


-0.08 


0.05 





02 


307.69 


CH 4 


331.55 


84.72 


2.89 


-0.08 





08 





00 


1.27 


-0 


19 


-0.08 


0.10 





02 


420.26 


CH 


57.22 


25.83 


0.89 


0.10 





03 





00 


0.14 


-0 


04 


-0.04 


-0.08 





02 


84.06 


CO2 


258.08 


116.34 


13.86 


-1.03 


1 


04 





05 


1.77 


-0 


48 


-0.53 


0.05 





07 


389.18 


CO 


181.58 


69.08 


8.02 


-0.55 





61 





05 


0.96 


-0 


16 


-0.31 


0.02 





04 


259.30 


F 2 


-31.08 


61.91 


7.63 


-0.35 





89 





03 


-0.10 


-0 


03 


-0.77 


0.00 





01 


38.14 


HF 


100.05 


39.31 


2.15 


-0.13 





10 





01 


0.18 


-0 


20 


-0.39 


0.08 





01 


141.18 


N 2 


119.69 


98.11 


9.49 


-0.79 


1 


03 





11 


0.79 


-0 


14 


0.00 


0.02 





10 


228.37 


NH 3 


203.28 


90.17 


3.89 


-0.14 





16 





01 


0.65 


-0 


25 


0.00 


0.14 





05 


297.93 


N 2 


95.13 


155.03 


18.81 


-1.51 


1 


98 





19 


1.15 


-0 


46 


-0.22 


0.04 





12 


270.20 


NO 


54.92 


87.51 


9.48 


-0.56 





84 





09 


0.41 


-0 


19 


-0.05 


0.01 





07 


152.52 


O2 


26.78 


83.95 


9.28 


-0.74 


1 


07 





10 


0.23 


-0 


18 


-0.45 


0.01 





03 


120.15 


O3 


-45.09 


163.94 


25.62 


-1.34 


3 


81 





41 


-0.05 


-0 


25 


-0.67 


-0.03 





07 


146.39 


NO2 


59.55 


147.16 


19.36 


-1.08 


1 


91 





19 


0.67 


-0 


43 


-0.45 


0.00 





10 


226.99 


Cl 2 


26.82 


28.08 


4.59 


-0.37 





44 





00 


0.14 


-0 


20 


-1.68 


0.00 





02 


57.83 


C1F 


15.41 


41.85 


5.25 


-0.30 





46 





02 


0.04 


-0 


18 


-1.23 


0.00 





01 


61.35 


CS 


104.16 


57.06 


9.68 


-0.62 





98 





06 


0.84 


-0 


16 


-0.64 


0.01 





03 


171.38 


H 2 S 


133.63 


47.66 


2.24 


-0.10 





15 





00 


0.33 


-0 


40 


-0.56 


0.05 





01 


183.00 


HC1 


80.85 


24.99 


1.48 


-0.11 





10 





00 


0.19 


-0 


25 


-0.84 


0.04 





01 


106.47 


HOC1 


86.70 


72.35 


6.77 


-0.47 





58 





02 


0.26 


-0 


33 


-1.06 


0.07 





03 


164.90 


PH 3 


173.22 


66.54 


2.01 


-0.03 





09 





00 


0.34 


-0 


46 


0.00 


0.15 





01 


241.88 


SO 


53.14 


64.15 


8.46 


-0.79 





84 





05 


0.49 


-0 


34 


-0.78 


0.01 





03 


125.30 


SO2 


121.91 


121.45 


15.83 


-1.28 


1 


56 





14 


0.97 


-0 


83 


-1.01 


0.02 





06 


258.80 


OCS 


218.25 


101.32 


14.47 


-1.09 


1 


33 





05 


1.41 


-0 


54 


-0.87 


0.03 





06 


334.37 


C1CN 


169.49 


101.51 


12.44 


-1.15 


1 


24 





11 


1.78 


-0 


45 


-0.93 


0.04 





08 


284.12 


C2H4 


434.97 


119.32 


7.40 


-0.46 





43 





03 


2.38 


-0 


33 


-0.17 


0.12 





04 


563.71 


H 2 CO 


264.83 


100.53 


7.92 


-0.53 





54 





03 


1.31 


-0 


34 


-0.31 


0.03 





04 


374.04 


HNO 


85.45 


109.46 


10.08 


-0.60 





99 





07 


0.40 


-0 


27 


-0.22 


-0.05 





07 


205.34 



(a) difference between the ACES II and MOLPRO definitions of the valence ROCCSD(T) 
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TABLE VI: Comparison between W4 total atomization energies at K, Active Thermochemical 
Tables benchmarks, and earlier reference data (kcal/mol) 





ZPVE(a) 


W2.2 


W3.2 


W41ite 


W4 


W4.2 


W4.3 


ATcT(b) unccrt. 


Earlier rcf. (c) uncert. 


H 2 


13.29 


219.39 


219.47 


219.43 


219.36 


219.35 


219.35 


219.36 





01 


219.35 





12 


C 2 H 2 


16.46 


388.61 


388.54 


388.57 


388.72 


388.72 




388.62 





07 


388.90 





24 


CH 3 


18.55 


289.12 


289.15 


289.14 


289.14 


289.14 




289.11 





03 


289.00 





10 


CH 4 


27.74 


392.52 


392.52 


392.52 


392.52 


392.52 




392.50 





03 


392.51 





14 


CH 


4.04 


79.89 


80.01 


80.01 


80.02 


80.02 


80.02 


79.98 





05 


79.90 





23 


CO2 


7.24 


381.70 


381.94 


382.06 


381.94 


381.94 




382.01 





03 


381.91 





06 


CO 


3.11 


256.01 


256.12 


256.17 


256.19 


256.18 


256.21 


256.25 





03 


256.16 





12 


F 2 


1.30 


36.15 


36.78 


36.85 


36.84 


36.87 


36.97 


36.91 





07 


36.94 





10 


HF 


5.85 


135.32 


135.39 


135.40 


135.33 


135.32 


135.30 


135.27 





00 


135.33 





17 


N 2 


3.36 


224.62 


224.92 


224.90 


225.01 


225.00 


225.07 


224.94 





01 


225.06 





04 


NH 3 


21.33 


276.62 


276.68 


276.62 


276.60 


276.59 


276.61 


276.59 





01 


276.73 





13 


N 2 


6.81 


262.44 


263.24 


263.42 


263.39 


263.40 




263.38 





03 


263.79 






NO 


2.71 


149.30 


149.66 


149.74 


149.81 


149.81 


149.86 


149.82 





02 


149.82 





03 


O2 


2.25 


117.25 


117.69 


117.77 


117.90 


117.91 


118.00 


117.99 





00 


117.97 





04 


O3 


4.15 


139.06 


142.06 


142.15 


142.24 


142.34 




142.48 





01 


142.51 






NO2 


5.40 


220.24 


221.31 


221.49 


221.59 


221.61 




221.67 





02 


221.70 






Cl 2 


0.80 


57.12 


57.03 


56.85 


57.03 


57.01 


57.08 


57.18 





00 


57.18 





00 


C1F 


1.12 


60.00 


60.17 


60.19 


60.23 


60.24 


60.29 








60.36 





01 


CS 


1.83 


169.24 


169.24 


169.10 


169.55 


169.51 


169.59 








169.41 





23 


H 2 S 


9.40 


173.59 


173.58 


173.54 


173.60 


173.60 


173.64 


173.55 





07 


173.15 





12 


HC1 


4.24 


102.32 


102.28 


102.20 


102.23 


102.22 


102.23 


102.21 





00 


102.24 





02 


HOC1 


8.18 


156.61 


156.71 


156.67 


156.72 






156.64 





43 


156.61 





12 


PH 3 


14.44 


227.36 


227.39 


227.40 


227.44 


227.47 


227.57 








227.13 





41 


SO 


1.64 


123.47 


123.53 


123.52 


123.66 


123.69 


123.76 


123.72 





02 


123.58 





04 


SOa 


4.38 


253.75 


254.37 


254.53 


254.42 






254.46 





02 


253.92 





08 


OCS 


5.72 


328.40 


328.52 


328.45 


328.65 












328.53 





48 


C1CN 


5.33 


278.73 


278.77 


278.58 


278.79 






279.42 





26 


279.20 





18 


C2H4 


31.60 


532.06 


531.99 


532.02 


532.11 


532.11 




532.00 





06 


531.91 





17 


H 2 CO 


16.53 


357.38 


357.48 


357.53 


357.51 


357.51 




357.43 





06 


357.25 





12 


HNO 


8.56 


196.27 


196.71 


196.71 


196.78 


196.78 




196.86 





03 


196.85 





06 



(a) Zero-point vibrational energies taken from the compilation in Ref. :3j, except for the following: 
CH3: D. W. Schwenke, Spectrochimica Acta A 55, 731 (1999), adjusted for expt.— calc. differences in the fundamental 
frequencies; CH4: D. W. Schwenke, Spectrochimica Acta A 58, 849 (2002), same adjustment made; H2O: 
CCSD(T)/cc-pVQZ quartic force field, fundamentals obtained by second-order rovibrational perturbation theory and 
adjusted for expt. -calc. differences in fundamentals; HOC1: K. A. Peterson, S. Skokov, and J. M. Bowman, J. Chem. Phys. 
Ill, 7446 (1999); N2O: as for H2O; O3: Obtained from experimentally derived harmonic frequencies and anharmonicity 
constants in A. Barbc, A. Chichery, T. Cours, V. G. Tyuterev, and J. J. Plateaux, J. Mol. Struct. 616, 55 (2002); N0 2 : 
from R. Georges, A. Delon, and R. Jost, J. Chem. Phys. 103, 1732 (1995); number not explicitly given there, quoted 
(presum abl y derived from their potential) by A. J. C. Varandas, J. Chem. Phys. 119, 2596 (2003). 

(b) Ref. [19J , except for H2S, SO, and SO2, which are from Ref . |2f| : the adjunct uncertainties correspond to 95% confidence 
intervals, as customary in experiment al thermochem istry, which were obtained by utilizing the full covariance matrix 
computed by ATcT; see see also Refs.QQHQ- 

(c) He I. •"' and references therein. 
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TABLE VII: Components of Wlw, W2w, W3, W2.2, W3.2, W41ite, W4, W4.2 and W4.3 



Component 


Wlw 


W2w 


W3 


W2.2 


W3.2 


W41ite 


W4 


W4.2 


W4.3 


Reference geometry 


B3LYP/ 
cc-pV(T+d)Z 






CCSD(T)/ 


cc-pV(Q+d)Z 






SCF 


AV{T,Q}+dZ 


AV{Q,5}+dZ 


AV{5,6}+dZ 


SCF extrap. 


(a) 


(a) 


(a) 


(b) 


(b) 


(b) 


(b) 


(b) 


(b) 


Valence CCSD 


AV{T,Q}+dZ 


AV{Q,5}+dZ 


AV{5,6}+dZ 


Val. CCSD extrap. 


(c) 


(d) 


(d) 


(e) 


(e) 


(e) 


(e) 


(e) 


(c) 


Valence (T) 


AV{D,T}+dZ 


AV{T,Q}+dZ 


AV{Q,5}+dZ 


Val. (T) extrap. 


(c) 


(d) 


(d) 


(d) 


(d) 


(d) 


(d) 


(d) 


(d) 


Valence T 3 -(T) 


— 




V{D,T}Z (f) 




V{D,T}Z (f) 


V{T,Q}Z (f) 


Valence (Q) 






1.25 PVDZ 




PVDZ 


PVDZ 


1.10 PVTZ PVQZ 


Valence T 4 -(Q) 






1.25 PVDZ 








1.10 PVDZ PVTZ 


Valence T5 














DZ (f) 


DZ (f) 


PVDZ 


CCSD(T) inner shell 


MTsmall 


ACV{T,Q}Z 


T 3 -(T) inner shell 
















CVTZ 


CVTZ 


Scalar relativistics 


MTsmall 


AVQ+dZ 


ZPVE 


0.985 B3LYP/ 
cc-pV(T+d)Z 


(h) 


(h) 


0) 


(i) 


0) 


0) 


(i) 


(i) 


DBOC 








HF/AVTZ 



The notation {D,T} refers to extrapolation from, in this case, cc-pVDZ and cc-pVTZ basis sets. 

(a) A + B/L 5 

(b) Martin-Karton formula 41]. 

(c) A + B/L 3 22 on valence correlation energies. 

(d) A + B/L 3 on valence correlation energies. 

(e) Separate L~ 3 and L~ 5 extrapolations on singlet and triplet coupled pairs, respectively. 

(f) CCSDT calculated with ACES II, CCSD(T) calculated with MOLPRO. 

(g) For second row systems a cc-pVDZ basis without the d functions was used. 

(h) From CCSD(T)/cc-pV(Q+d)Z quartic force field. 

(i) Best available, usually obtained from expt. fundamentals and high-level ab initio anharmonic force field (whence 
anharmonic corrections). See Table fvTI for further details. 
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TABLE VIII: Valence-only and all-electrons-correlated total atomization energies (kcal/mol) at 



the CCSDT level with different reference orbital choices, as well as using different definitions of 
CCSD(T). The cc-pCVTZ basis set was used throughout. 





CCSD(T) CCSD(T) CCSD(T) 


CCSDT 


CCSDT 


CCSDT 


CCSD(T) CCSD(T) CCSD(T) 


CCSDT 


CCSDT 


CCSDT 


Ref. 




ROHF 


ROHF 


ROHF 


UHF 


ROHF 


TTMT? 
U nr 


ROHF 


ROHF 


ROHF 


UHF 


ROHF 


Def. 




Aces 


MOLPRO 










Aces 


MOLPRO 








Orbs. 








semican. 


std. 


std. 








semican. 


std. 


std. 






All electrons correlated 








Only valence electrons correlated 






63.38 


KQ AA 
Do. 44 


fiQ AA 
DO. 44 


KQ 1« 
Do. ( D 


63.75 


63.76 


62.85 


AO OQ 

oz.yo 


AO OQ 

Dz.yo 


KQ Ol 

Do.zl 


KQ 1 O 

Do. ly 


63.21 


RN 


98.56 


OQ P7 


no c 1 ? 
yo.O ( 


C\R 1 Q 

yD. lo 


96.18 


96.18 


97.72 


07 QO 


07 Q7 

y / .0 / 


OP Q 1 

yo.oi 


OP OO 

yo. zz 


95.37 


Co 


140.37 


1 A O Q7 
14U.O ( 


1 A Q7 
14U.O / 


1 "JQ KO 

loo.Oz 


138.61 


138.62 


139.51 


1 QO AO 

ioy .ou 


1 QO «A 

ioy .04 


1 Q7 fifi 
lo / .00 


1 Q7 Pfi 

lo / .OD 


137.69 


IN 2 


217.93 


0"l 7 O/l 

Zl ( .y4 


0"1 7 O/l 

zi ( .y4 


Ol 7 Q7 


217.36 


217.37 


217.00 


01 7 1 7 

Zl i .Li 


01 1 07 

Zl / .Z / 


Ol Pfi 

Z1D.0D 


Ol QO 

zio.oy 


216.65 


CO 


OPQ flQ 

zOo.Uo 


opq oq 
ZOo.Uo 


opq oq 
zoo.Uo 


OPO fi/l 
ZOZ.04 


252.64 


252.64 


opo 1 1 

ZOZ. 11 


OPO OQ 
ZOZ.ZO 


oc;o 07 
ZOZ. Z / 


OP1 QO 
ZOl .oZ 


OP1 70 
ZOl. (U 


251.86 


C02 


377.86 




Q77 Qfi 

/ / .oD 


077 no 
f < .Uo 


377.08 


377.08 


376.18 


Q7K Qft 

1O.0O 


Q7fi A Q 

/D.4o 


Q7P P P 

1 0.00 


Q7P Qfi 


375.61 


IN 


255.66 


occ; C7 
ZOO.O ( 


zOO.O ( 


nc;,! /in 
Z04.4y 


254.49 


254.49 


254.23 


OP/1 A 7 

Z04.4 / 


oc;/i c.o 
Z04.0y 


OPQ OQ 

zOvj.zo 


OPO OO 

zoz.yy 


253.34 


Do 


114.35 


11/1 QQ 
114. 00 


11/1 QQ 
114.00 


llo.oO 


113.98 


113.85 


113.97 


11/1 OQ 
1 14. Uo 


1 1 A 1 O 
1 14. 1Z 


1 1 Q PO 

1 10. OZ 


1 1 Q P7 
llv5.0 / 


113.55 


O, 


133.25 


1 QQ OP 
lOO.zO 


1 qq op. 
loo.zO 


1 Qfi 
I0Z.0O 


132.36 


132.36 


132.86 


1 QQ Ofi 
loo.UD 


1 QQ 1 Q 

loo. lo 


1 QO OP. 
loZ.UO 


1 Q1 Q/1 
lol.o4 


132.12 


r 2 


35.05 


OO. UO 


qp op 
oO.UO 


Q/l QP 
o4.oD 


34.85 


34.85 


35.01 


QP OO 

oo.uy 


qc iq 
oO. 1U 


Q/l Qfi 
v54.oD 


Q/1 77 
o4. / I 


34.87 


HF 


137.43 


1 Q7 A Q 
lo ( .4o 


1 Q7 /I Q 
lo ( . 4o 


1 Q7 qq 
lo I .00 


137.33 


137.33 


137.21 


1 Q7 O^ 

lo / .zO 


1 Q7 OP. 

lo / .zO 


1 Q7 1 P 
lo / .10 


1 Q7 1 1 
lo ( .11 


137.16 


CN 


171.22 


1 70 QO 
1 / Z.Oy 


1 / z.oy 


1 70 OP 
1 / Z.ZO 


172.26 


172.25 


170.20 


1 71 A 
1 / 1.4D 


1 71 CQ 
1 / l.OO 


1 71 OO 

1 ( i.zy 


1 71 1 7 
1 ( 1.1 ( 


171.36 


NO 


144. 19 


144.O0 


1/1/1 qc; 
144. oO 


1 /I Q OO 

i4o.yy 


144.02 


143.99 


143.59 


1 A Q QQ 

14o.oo 


1 A Q OP 

i4o.yo 


1 A Q A Q 
14o.4o 


1 /I Q QO 

i4o.oy 


143.55 


NO2 


213.37 


01 Q PO 
zlo.OU 


01 q po 
zlo.OU 


01 77 

zlz. ( t 


212.83 


212.77 


212.36 


OIO 70 
ZlZ. IK) 


OIO 70 

ziz. ( y 


01 1 OO 

zii.yu 


Ol 1 7P 
Zll. / 


211.99 


CH 


81.94 


Q 1 O/l 


Q1 O/l 

ol.y4 


co op 
oz.UO 


82.05 


82.05 


81.77 


Q1 QO 


Q1 Q1 

ol.ol 


Q 1 OO 

oi.yu 


Q 1 QQ 


81.92 


v 1 1 , ; 


303.51 


oUo.oz 


ovo.OZ 


qoq pi 
0U0. 01 


303.51 


303.51 


oUz.oo 


QOO fifi 

oUz.DD 


QOO fiQ 

vSUz-Do 


qoo a p 
oUz.DO 


QOO fiQ 

oUz.Oo 


302.67 


v - y 2 1A 2 


oyo.Uo 


one no 

oyo.Uo 


one no 

oyo.uo 


oyo.oo 


395.53 


395.53 


on a nn 

oy4.(jy 


QO/1 1 Q 

oy4. 10 


QO/1 OO 

oy4. ZZ 


QOQ fil 

oyo.Di 


QOQ PO 

oyo.oz 


393 65 

Oi/O.OO 




554.36 


004. OO 


004. oO 


p P/i ri/i 
004. U4 


554.04 


554.04 


552.36 


c;c;o /i c; 
OOz.40 


PPO /I O 

ooz.4y 


PPO 1 
OOz.lz 


PPO OO 

OOz.uz 


552.15 


CH4 


415.31 


AA P. Q1 
410.01 


A1 P. Q1 
410.01 


/II p 07 
410.Z 1 


415.27 


415.27 


414.23 


A1 A 9£. 

^il^i.ZO 


AA A QO 


AA A. 9Q 

^il^i.ZO 


AA A. 1 & 
"4:1^. 10 


414.25 


IN XI j 


289.79 


289.79 


289.79 


289.72 


289.72 


289.72 


oon n""7 

289. U7 


289.16 


289.21 


289.08 


289.00 


289.13 


1 1 2 >J 


226. 13 


226.13 


226.13 


225.99 


225.99 


225.99 


225.68 


225.75 


225.77 


225.60 


225.53 


Zi^O.UO 


Ho CO 


365.67 


365.67 


365.67 


365.29 


365.29 


365.29 


364.43 


364.54 


364.59 


364.15 


364.04 


364.20 


HNO 


1 op 7Q 
iyo. 1 


195.73 


195.73 


195.34 


195.34 


195.34 


inr; no 

lyo.uo 


195.24 


195.31 


194.81 


194.65 


194.88 


CI9 


53.93 


53.91 


53.91 


53.62 


53.61 


53.62 


53.60 


53.67 


53.69 


54.19 


53.29 


53.41 


C1F 


56.51 


56.50 


56.50 


56.32 


56.31 


56.32 


56.31 


56.38 


56.40 


56.57 


56.08 


56.19 


CS 


164.90 


164.89 


164.89 


164.45 


164.43 


164.45 


164.06 


164.17 


164.21 


164.23 


164.61 


163.77 


H 2 S 


179.69 


179.69 


179.69 


179.64 


179.63 


179.64 


179.30 


179.37 


179.38 


179.80 


179.24 


179.33 


HC1 


105.14 


105.13 


105.13 


105.04 


105.03 


105.04 


104.91 


104.94 


104.95 


105.26 


104.81 


104.87 


PH 3 


236.74 


236.74 


236.74 


236.81 


236.81 


236.81 


236.35 


236.45 


236.46 


236.97 


236.38 


236.49 


SO 


118.40 


118.63 


118.63 


118.07 


N/A 


118.07 


117.84 


118.18 


118.20 


118.06 


117.53 


117.60 
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TABLE IX: Diagnostics for importance of nondynamical correlation 



%TAE[SCF] Tj Dj Largest T 2 %TAE[(T)] %TAE %TAE[T 4 + T 5 ] NO occupations 

diagnostic amplitudes [post-CCSD(T)] HDOMO LUMO 

- CCSD(T)/cc-pVTZ - (a) 



H 2 


68.8 


0.007 


0.011 


0.048 


1.52 


-0.005 


0.082 


1.962 


0.026 


B 2 


30.3 


0.039 


0.071 


0.286 


14.98 


2.445 


2.178 


1.830 


0.111 


C 2 H 2 


74.0 


0.013 


0.028 


0.084 (x2) 


2.07 


0.012 


0.190 


1.924 (x2) 


0.057 (x2) 


CH 3 


79.1 


0.005 


0.009 


0.036, 0.034(x2) 


0.62 


0.008 


0.018 


1.959 


0.022 


CH 4 


78.9 


0.007 


0.011 


0.035 (x2) 


0.69 


-0.001 


0.019 


1.958 (x3) 


0.022 (x3) 


CH 


68.1 


0.008 


0.017 


0.09112 (x2) 


1.05 


0.160 


0.040 


1.940 


0.020 


C0 2 


66.3 


0.018 


0.047 


0.063 (x2) 


3.57 


0.015 


0.281 


1.948 (x2) 


0.054 (x2) 


CO 


70.0 


0.019 


0.039 


0.067 (x2) 


3.10 


0.038 


0.252 


1.945 (x3) 


0.053 (x3) 


F 2 


-81.5 


0.011 


0.029 


0.169 


19.84 


1.461 


2.354 


1.904 


0.097 


HF 


70.9 


0.007 


0.012 


0.040 


1.52 


-0.013 


0.080 


1.967 


0.025 


N 2 


52.4 


0.013 


0.026 


0.095 (x2) 


4.18 


0.152 


0.498 


1.931 (x2) 


0.063 (x2) 


NH 3 


68.2 


0.006 


0.010 


0.036 


1.31 


0.011 


0.057 


1.959 


0.025 


N 2 


35.2 


0.020 


0.048 


0.086 (x2) 


6.99 


0.245 


0.804 


1.928 (x2) 


0.074 (x2) 


NO 


36.0 


0.021 


0.051 


0.113 


6.24 


0.240 


0.609 


1.944 


0.060 


2 


22.3 


0.007 


0.013 


0.094 


7.73 


0.357 


0.971 


1.950 


0.042 


O3 


-30.8 


0.027 


0.077 


0.192 


17.74 


1.954 


2.866 


1.876 


0.014 


C 2 


12.5 


0.038 


0.086 


0.293 


13.33 


0.311 


1.813 


1.629 


0.362 


BN 


-10.8 


0.073 


0.199 


0.224 


18.75 


-0.316 


2.226 


1.833 


0.157 


MgO 


-62.0 


0.051 


0.106 


0.213 


21.96 


1.118 


2.636 


1.843 


0.147 


BeO 


51.1 


0.043 


0.104 


0.041 (x2) 


8.11 


0.168 


0.368 


1.943 (x2) 


0.044 (x2) 


CN 


46.9 


0.053 


0.152 


0.092 (x2), 
0.077 (x2) 


5.85 


0.519 


0.716 


1.926 


0.063 


NO2 


26.2 


0.025 


0.065 


0.093 


8.56 


0.447 


0.923 


1.943 


0.077 


Cl 2 


46.4 


0.008 


0.021 


0.091 


7.72 


0.132 


0.749 


1.930 


0.065 


C1F 


25.1 


0.011 


0.031 


0.091 


8.40 


0.298 


0.769 


1.937 


0.066 


CS 


60.8 


0.025 


0.049 


0.092 (x2) 


5.67 


0.245 


0.604 


1.920 (x2) 


0.069 (x2) 


H 2 S 


73.0 


0.009 


0.016 


0.046, 0.045, 
0.044, 0.043 


1.22 


0.027 


0.080 


1.943 


0.039 


HCl 


75.9 


0.006 


0.011 


0.043, 0.040 (x4) 


1.38 


-0.009 


0.093 


1.951 


0.035 


HOCl 


52.6 


0.010 


0.024 


0.061 


4.08 


0.077 


0.361 


1.932 


0.064 


PH 3 


71.6 


0.013 


0.021 


0.044 (x2), 
0.038 (xl + 2), 
0.037 


0.83 


0.028 


0.039 


1.933 


0.037 (x2) 


SO 


42.4 


0.023 


0.052 


0.074 


6.73 


0.082 


0.708 


1.942 


0.043 


S0 2 


47.1 


0.021 


0.056 


0.091 


6.11 


0.163 


0.655 


1.929 


0.084 


OCS 


65.3 


0.019 


0.049 


0.067 (x2) 


4.33 


0.065 


0.392 


1.932 (x2) 


0.062 (x2) 


CICN 


59.7 


0.014 


0.028 


0.074 (x2) 


4.39 


0.069 


0.476 


1.927 (x2) 


0.063 (x2) 


C2H4 


77.2 


0.011 


0.032 


0.119 


1.32 


-0.001 


0.081 


1.916 


0.064 


H 2 CO 


70.8 


0.016 


0.045 


0.119 


2.12 


0.013 


0.154 


1.927 


0.063 


HNO 


41.6 


0.015 


0.043 


0.110 


4.92 


0.241 


0.515 


1.912 


0.086 



Percentages of the total atomization energy relate to nonrelativistic, clamped-nuclei values with inner shell electrons 
constrained to be doubly occupied. 

Di diagnostics were obtained using MOLPRO 2006. lQi- 
(a) Highest doubly occupied molecular orbital. 



